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Abstract 

We consider a zero mean discrete time series, and define its discrete Fourier transform at 
the canonical frequencies. It is well known that the discrete Fourier transform is asymptotically 
uncorrelated at the canonical frequencies if and if only the time series is second order stationary. 
Exploiting this important property, we construct a Portmanteau type test statistic for testing 
stationarity of the time series. It is shown that under the null of stationarity, the test statistic 
is approximately a chi square distribution. To examine the power of the test statistic, the 
asymptotic distribution under the locally stationary alternative is established. It is shown to 
be a type of noncentral chi-square, where the noncentrality parameter measures the deviation 
from stationarity. The test is illustrated with simulations, where is it shown to have good 
power. Some real examples are also included to illustrate the test. 

Keywords and phrases Discrete Fourier Transform, linear time series, local stationarity, 
Portmanteau test, test for second order stationarity. 

1 Introduction 



An important assumption that is often made when analysing time series is that it is at least 
second order stationary. Most of the linear time series literature is based on this assumption. If the 
assumption is not properly tested and the analysis is performed, the resulting models are considered 
to be misspecified and any forecasts obtained are not appropriate. Therefore, it is important to 
check whether the time series is second order stationary. 

In recent years several statistical tests have been proposed. Many of the proposed tests are based on 
comparing spectral densities over various segments (Priestley and Subba Rao (1969), Picard (1985), 
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Giraitis and Leipus (1992), Adak (1998) and Paparoditis (2009)) comparing covariance structures 
over various segments of the data (Loretan and Philips (1994), Andreou and Ghysels (2008) and 
Berkes, Gombay, and Horvath (2009)), or comparisons of wavelet coefficients (Sachs and Neumann 
(1999), Cho and Fryzlewicz (2009)). The underlying important assumption, on which these tests 
are based, is on a delicate, subjective, choice of segments of the data. In this paper we propose a 
test based on the discrete Fourier transforms based on the entire length of data, thus avoiding a 
subjective choice of segment length. 

In Section 2 we define the Discrete Fourier transform and show that the Discrete Fourier transforms 
are asymptotically uncorrelated at the canonical frequencies if and only if the time series is second 
order stationary. This motivates the test statistics, which is based on the discrete Fourier transform. 
The Portmanteau type test statistics we propose is based on the covariance function calculated using 
the discrete Fourier transform at the canonical frequencies. The asymptotic sampling distribution of 
the test statistic is obtained in Section 3. Further we show that the asymptotic sampling distribution 
is approximately distributed as a central chi-square under the null hypothesis that the time series is 
second order stationary. To examine the power of the test, we consider the case of locally stationary 
time series (see Subba Rao (1970), Ozaki and Tong (1975), Priestley (1981), Dahlhaus (1997), Nason 
and Sachs (1999), and Nason, Sachs, and Kroisandt (2000)), and derive the distribution of the test 
statistic under this class of alternatives. In Section 4 we show the distribution under this class 
of alternatives, is a type of non-central chi-square, where the noncentrality parameter is in some 
sense a measure of departure of nonstationarity. In Section 5, through simulations, we examine the 
power of the test, and show that for various types of alternatives the power is very high. We end 
this section with various comments on the types of nonstationary behaviour the test can detect. In 
Section 6 we illustrate the test with two real data examples. 

An outline of important aspects of some proofs can be found in the appendix, and the full details 
can be found in the accompanying technical report. 

2 The Test Statistic, motivation and sampling distribution 

2.1 Motivation 

Let {X t } be a zero mean time series. Suppose we observe {X t ; t = 1, . . . , T} and let Jri^k) be its 
Discrete Fourier Transform defined as 
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where to k = are the canonical frequencies. It is well known that if {X t } is a second order 
stationary time series, whose covariances are absolutely summable then for k\ ^ k 2 and k\ ^ T — k 2 
we have cov ( Jt (uJ kl ), JrO^)) = O(^). Therefore in the case of stationary processes, the discrete 
Fourier transform {JT(uj k )} k=1 is asymptotically uncorrelated. Let 

T 

I 1 

n(t, r) = E(X t , X T ) = - ^] E( J T (u kl ) J T (u k2 )) exp(-itu kl + iru k2 ), 

fcl,fc2 = l 

where z is the complex conjugate of the complex variable z. From the above we observe if 
E( Jri^ki) JT{wk 2 )) — for k\ ^ k 2 or k\ ^ T—k 2 , then we have n(t, r) = K,(t—r) for < t, r < T—l. 
In other words, an uncorrelated discrete Fourier transform sequence implies that the original time 
series is second order stationary, up to lag T. 

Let us consider a simple example to show that even if the time series are independent, but not sta- 
tionary, then {Jr(^fc)} are not uncorrelated. Let us suppose X t = a t e t , where a t is a deterministic, 
time dependent function and {e t } are independent identically distributed random variables with 
E(£ t ) = and var(e t ) = 1. In this case, the covariance of the discrete Fourier transform at the 
canonical frequencies is 

1 T 

cov(J T (uj kl ), J T {uk 2 )) = — -= VV t 2 exp(it(cj fcl -u k2 )). 

2ttT 

From the above, it is clear that cov(J T (wfc 1 ), Jr(^fc 2 )) f° r some k-y ^ k 2 . If we suppose that a t 
is a sample from a smooth function a : [0, 1] — > R, that is a t = a(t/T), then by using the rescaling 
methods often used in nonparametric statistics we have 

cov(J T (u kl ), J T (co k2 )) = / a(u) 2 exp(i2nu(ki - k 2 ))du. 

Jo 

2.2 The test statistic 

The above observations lead us to the following test statistic. We note that, if the the time se- 
ries is second order stationary, then ~K(J T (u k )) = and \ax{J T {oj k )) — > f(co k ) as T — > oo, where 
/ : [0, 2tt] — > K. is the spectral density of the original time series {X t } (see Priestley (1981) and 
Brockwell and Davis (1987)). Therefore by standardising with a/ f(oo k ), under the null of station- 
arity, {Jr(oo k )/ a/ fiy^k)} is close to an uncorrelated, second order stationary sequence. Therefore 
to test for stationarity of {X t } we will test for randomness of the sequence {Jrij^k)/ a/ f(co k )}. The 
proposed test will be a type of Portmanteau test (see Chen and Deo (2004) for applications of the 
Portmanteau test in time series analysis). Of course, in reality the spectral density f(cu) is unknown, 
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therefore we will replace /(•) with the estimated spectral density function fr(-), where 

3 



X 



K : [—1/2, 1/2] — > R is a positive, continuous, symmetric kernel function which satisfies K(x)d. 
1 and K{x) 2 dx < oo and b is a bandwidth. 

We define the empirical covariance at lag r, which is complex valued, of the discrete Fourier trans- 
form as 

T 

c T {r) = -2^ ^ ^=, forl<r<T-l. (2) 

The proposed test statistic is based on or(r). In the technical report we show that if {X t } is a 
stationary time series and K(Xf) < oo then both the variance of the real and imaginary parts of 
v / Tcr(r) converge to 

1 + ^ C V r /4(Ai,-A 1 -o; r ,A 2 ) 

4W 7o v //(A 1 )/(A 1 + cu r )/(A 2 )/(A 2 + cu r ) 

as T -> oo, where w r = 2vrr/T, / 4 (Ai, A 2 , A 3 ) = (2tt)" 3 ^~ j2)j3= _ 00 cum(X , X jl5 X j2 , X h ) exp(z(Aiji+ 
A2J2 + A3J3)) is the fourth order cumulant spectra. Furthermore, under the null hypothesis of second 
order stationarity, we show in Lemma 3.1, that the empirical covariances ct{t) at different lags are 
asymptotically uncorrelated and cr(r) = o p (l). Therefore we define the test statistic 

\cT{r n )\ 2 



Tm 1 , J_ r2. r 2n fj(A| ■ — A| — u-V,, . — Xj) 



where \z\ 2 = zz and r n 7^ or T/2. For example, we can choose r n = n, and use m consecutive 
lags. We note, that unlike the classical Portmanteau tests, using covariances with a large lag is not 
problematic as the discrete Fourier transform is periodic. 

We derive the asymptotic distribution of the test statistic in Section 3, under the null hypothesis 
that {X t } statisfies the MA(oo) representation 

00 

X t = J2^t-j, (4) 

3=0 

where {e t } are independent, identically distributed random variables with E(e t ) = 0, E(e\?) = 1 and 
k 4 = cum 4 (e ( ). Under these assumptions we will show in Corollary 3.1, below, that Cr(r) = o p (l) 
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and T m converges in distribution to x 2m - Therefore we reject the null of second order stationarity at 
the ct% significance level if T m > xLi(l ~~ °0- We proved the above result under the assumption that 
the time series is stationary, linear and has absolutely summable covariances. But we believe this 
result is true even if the process is nonlinear, but stationary or has long memory but is stationary. 
The proof is beyond the scope of this paper. We need strong mixing condition to prove this general 
result. 

In the case of linearity, (3) has an interesting form. It can be shown that 



1 f w f 77 / 4 (Ai, — Ai — w r , A 2 ) 



zd\id\2 = «4 



1 

2^ Jo 



2?r 



exp(i(p(uj) — <f>(u + uj r ))duj 



2Wo Jo ^f(X l )f(X 1 + u r )f(\ 2 )f(X 2 + uj r ) 
where 

,, . $M(w) 1 ^ 

<P(uj) = arctan — — — — and A{uo) = . > w, expuu;?). 

Hence in the case of linearity, the test statistic is equivalent to 

T m = T± = (5 ) 

Morever, if m is small and small lags are used (ie. r n = n) then the test statistic can be approximated 
bjT m = TZZi |c T (r)| 2 /(l W2). 

Remark 2.1 (Estimation of the tri-spectra) We observe that the test statistic T m requires es- 
timates of the the parameter 

Kr = — r r w.-a.-^-a,) 

2Wo Jo ^/(A 1 )/(A 1 +^.)/(A 2 )/(A 2 + ^) 

Therefore to estimate the above parameter we require estimators of the tri-spectra and spectral 
density f\ and f respectively. Brillinger and Rosenblatt (1967) propose a consistent estimator of 
the tri-spectra fi{-), which we denote as /4,t( - )- Therefore an estimator of the above integral is 

r . 



2tt 



o Jo 



\j ?T(Xi)f T (\l + Wr)^(A 2 )^(A 2 + U r ) 



where /t(A 2 ) is defined in (1). Since k r>T is a consistent estimator of K r , replacing n r in the test 
statistic with k r> T, does not alter the asymptotic sampling distribution. 

Remark 2.2 (Practical issues) The asymptotic distribution under the null is derived under the 
assumptions that the spectral density of the time series {X t } is bounded away from zero. In prac- 
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tice, even if this assumption holds, the estimated spectral density fr(-) 'may be quite close to zero. 
Therefore, in this case, to prevent falsely rejecting the null, we suggest adding a small ridge to the 
spectral density estimator fr(') to bound it away from zero. 

2.3 The power of the test 

In Section 4 we obtain the asymptotic sampling properties of the test statistic T m , under the 
alternative of local stationarity. In order to understand what nonstationary behaviour the test 
statistic can detect and how to select the lag r in the test statistic, we will now outline some 
of the results in Section 4. Suppose that {X t } is a nonstationary time series, where in a small 
neighbourhood of t the observations are close to stationary and has the local spectral density 
f(^,uj). In Lemma 4.1 we show that Ct(t) B(r), where 

B{r) = L PIT^F /(tt ' A) *M-i^rr)dud\, (6) 

and that T m has asymptotically a type of non-central chi squared distribution where the noncen- 
trality parameters are given by {B(r n )}. Hence, the test statistic is more likely to reject the null, 
the further B(r n ), is from from zero. Studying the above we see that if the dynamics change slowly 
over time, then a small lag r n , should yield a large B(r). On the other hand, if there is a rapid 
change in the behaviour, a large r n , leads to a large B(r n ). Therefore, in this case, by using a large 
r n , we are more likely to reject the null. 

3 Sampling properties of the test statistic under the null 

We now derive the asymptotic distribution under the null of stationarity. We will use the following 
assumptions. 

Assumption 3.1 Let us suppose that {X t } satisfies (4)- 

Let A(uj) = (27T) 1 / 2 Yl'jLo i'j ex P(*i w ) and define the spectral density f(u) = \A(u)\ 2 . Assume 

(i) Y^jLo Ij'V'jl < 00 (noting that this implies sup w |/'(^)| < 00). 

(ii) E(|e t 16 |) < 00. 

(Hi) inf w f(u) > and inf w \$tA(cu)\ 2 > 0. 

(iv) Either (a) . \j 2 ipj\ < 00 or (b) the derivative of the spectrum f'(ou) is piecewise montone on 
the interval [0, 2n] (in other words /'(•) can be partitioned into a finite number of intervals 
which is either nonincreasing or nondecreasing). 
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We use Assumption 3 . 1 (i,ii) to show asymptotic normality if cr(r) (in fact Assumption 3. 1 (ii) is 
used to obtain the error when replacing cr(r), with ct(t), defined below). We use Assumption 
3.1(iv) to obtain the rate of decay of the Fourier coefficients of the function , 1 

To simplify the analysis of the test statistic T m we replace the denominator in the covariance cr(r) 
with its deterministic limit. To do this, we define the unobserved covariance 



and obtain the asymptotic distribution of cr(r). 

In the following lemma we show that the difference between y/T\or(r) — cr(r)\ is negligible. 

Theorem 3.1 Suppose Assumption 3.1 is statisfied and let Cr(r) and Ct(t) be defined as in (2) 
and (7) respectively. Then we have 

VT\c T {r) - Mr) I = O p ((6 + -j=) + + b^) (1 + ^L)) . (8) 

PROOF. See Appendix A. 2, Lemma A.7, equation (23). □ 

In the following lemma we derive the asymptotic variance of c-r(r), and show that ct(t) is asymp- 
totically uncorrelated at different lags r, and at the real and imaginary parts. 

Lemma 3.1 Suppose Assumption 3.1 holds. Then we have 

cov(\ / T3?c r (r 1 ),v / r3?c T (r 2 )) = cov(VT^5 T ( ri ) , VT^d T (r 2 )) 

oa) n ^ r 2 



x ~ 2 



1 exp(i0(w) - 0(cu + ^))duo 



2tt 



+ 0(1) ri = r 2 ' 



and for allr 1} r 2 G Z, cov(VT3c T (ri), y/T$tbr(r 2 )) = 0(±). 

PROOF. See Appendix A.3. □ 
We now show normality of dr(r), which we use to obtain the distribution of T m . 
Theorem 3.2 Suppose Assumption 3.1 holds. Then for fixed m we have 



7 



as m(b + -j=) + (^72 + b 2 T 1/2 ) YZLi (l^j + ^172) ~^ and T — > oo ; w/iere J 2m is identity 
matrix and tp(x) = |^ / Q 27r exp(i0(cj) — </>(cj + rr))du;| 2 . 

PROOF. See Appendix A.4. □ 
By using the above we are able to obtain the asymptotic distribution of T m . 

Corollary 3.1 Suppose Assumption 3.1 holds. Then for fixed m we have T m x\ m w ^ 
m ( b + vW) + (WV* + &2TV2 ) £n=i + ^) - 0, a S T ^ 00. 

PROOF. The result immediately follows from Theorem 3.2. □ 
Hence we have shown under the null, that the test statistics has asymptotically a y 2 distribution. 



4 Sampling properties of the test statistic under the alter- 
native of local stationarity 

It is useful to investigate the behaviour of the test statistic in the case that the null does not hold. 
If the covariance structure varies over time, then the limit of ct{t) will not be zero. This suggests 
that the test statistic will have a type of non-central x 2 distribution. However, in the case that 
time-varying covariance has no structure it is not clear what the limit of c"t(V) will be. For example, 
consider the simple example of a time-varying AR process, X t = a(t)X t _ 1 + e t , where {e t } are iid 
random variables. Without any structure on the AR coefficient, a(t), it is not clear what the spectral 
density estimator, fr('), defined in (1) should converge to. Hence it is not possible to obtain the 
limit of ct{t). On the other hand, let us suppose that a(t), varies slowly over time and a(t) is a 
sample from a function a : [0, 1] — ■> R, that is for some T, a(t) = a(^), and the time series satisfies 
X t> T = a(j;)X t _ ltT + e t , t — 1, . . . ,T. Now in this set up, as we let T — > 00, a(-) varies less and 
X t} T is observed on a finer grid, which in reality can never be realised. Hence, by supposing that 
a(-) varies slowly over time we have imposed some structure on the time-varying parameter and we 
are using an infill asymptotic argument (see Robinson (1989)). In this case, we will show, below, 
that /t(-) is an estimator of the integrated spectrum, this can also be viewed as the average of 
local spectrums. The model described above, is an example of a locally stationary linear time series 
considered in Dahlhaus (1997) and Dahlhaus and Polonik (2006). Therefore, following Dahlhaus 
and Polonik (2006), we define a locally stationary linear time series as 

00 

X t , T = J2^Aj)et-j, (10) 
3=0 
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where {e t } are iid random variables, E(e t ) = 0, var(e t ) = I. Therefore we will consider the behaviour 
of the test statistic under the alternative of local stationarity. 

In order for {X t ^} to be a locally stationary time series, we will assume that ipt,r{j) closely approx- 
imates the smooth function ipj(-). Hence the time- varying MA parameters {ipt,T(j)} vary slowly 
over time. It can be shown that in this case, {X tT } is a locally stationary time series because it 
can locally be approximed by a stationary time series. We will use the following assumptions. 

Assumption 4.1 Let us suppose that {X tt r} satisfies (10). Suppose, there exists a sequence of 
functions ipj{u), such that ipj{u) is Lipschitz continuous and \ipj(if) — ipt,T{j)\ < T~ 1 £(j)~ 1 , where 
is a positive monotonically decreasing function which satisfies ^ \i\ 2 (-(j)~ l < oo. Let 

f(u,u) = (2ny 1 \ Ej°lo^( M ) ex P(^)| 2 ( hence SU P«,^ I d2f d { X ] 1 < °°)- 

(i) sup tt |^-(u)| < Kj)- 1 and sup u |^| < Ki{f)^ (hence sup^ |^| < ooj. 
(n) E(|4 6 |)<oo. 

(Hi) Define the integrated spectral density f(u) = f(u,u)du, and assume that inf w f(u) > 0. 

(iv) Either (a) sup u 5^ - \ip"{u)\ < oo (hence sup uw | 9 1 < oo) or (b) A{u,uj) and dA< ^^ are 
piecewise monotone functions with respect to u. 

We will show in Lemma 4.1, below, that in the locally stationary case the spectral density estimator 
/t(-) defined in (1) estimates the integrated spectrum f(u), where f(cu) is defined in Assumption 
4.1(iii). Roughly speaking, one can consider the integrated spectrum as the average of the locally 
stationary spectrums. 

As in Section 3, it is difficult to directly obtain the distribution of cr(r). Instead we replace the 

random denominator with its deterministic limit (that is Jr{^k)/ \J fri^k) with Jri^k) / y/ f (^k)) , 
and define 



T VfMf(Uk+r) 

where /(•) is the integrated spectrum. The following result is the locally stationary analogue of 
Theorem 3.1. 

Theorem 4.1 Suppose Assumption 3.1 is statisfied, and let cr(r) andcrir) be defined as in (2) 
and (11) respectively. Then we have 
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PROOF. In Appendix A. 2, Lemma A.7, equation (24). □ 

From the lemma above we see that in order for the sampling properties of VTct(t") and VTcrir) 
to coincide, we require that T -1 / 2 << b « T -1 / 4 . 

We now obtain the mean and variance of ct{t) under the alternative hypothesis of local stationarity. 

Lemma 4.1 Suppose Assumption 4-1 are satisfied and let f(ou) and f(u,u) be the integrated and 
local spectrum (defined in Assumption 4-1) respectively. Then we have 

E(/ T H-/M) 2 = 0(6 2 + ^), (12) 



E(c T (r)) - [/M/( J +c , r)]1/2 (/ /(«,o;)exp(-2i7rr«)d«)da;, (13) 

as T — > oo ; and 

cov(^VTd T ( ri ),^VTd T (r 2 )) - cov^VT5 T ( ri ),%VT5 T (r 2 )) ^Z ( ^l r2 

coviQfVfcrinlQfy/fcrfa)) - 2%$ n , (14) 



b — > 0, oT — > oo as T — > oo ; where 



_ic r (i) ,r(2) , r (2) , r (3) x . nr logT ^ 

^T.ri,^ 4 I 1 T,n,r 2 ~ 1 T,n,r 2 ~ 1 T,r 2 ,n ~ 1 T,n,r 2 ) ~ J> 



v (i,2) zlcrW ^H 2 ) -H 2 ) _r( 3 ) ^ i rv logT ^ 



v (2,2) _i/ r (l) _ F (2) _ r (2) ,p(3) x p/ lQ g T X 

z - , T,ri,r 2 4V 1 T,ri,r 2 1 T,n,r 2 1 T,r 2 ,ri 1 T.n.rJ 1" ^ I j, ^ 



and n = l,2,3j are defined in Lemma A. 12 (in Appendix A. 3). 

PROOF. See Appendix A.3. □ 

We use the above to obtain the asymptotic distribution of T m under the alternative. First we recall 
that we estimated the standardisation of cr(r), K r , in Remark 2.1. It is worth noting that in the 
case that of local stationarity, k r>T is an estimator of 

1 [ 2 * f 2w ACAi.-A^^-Aa) 
2 Wo 7o v / /(Ai)/(Ai + A 2 )/(A 2 )/(A 2 + o; r ) 
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where /(•) is the integrated spectral density and and f^Xi, A 2 , A3) = J* fi{u, Ai, A 2 , X 3 )du, with 
U(u, Ai, A 2 , A 3 ) = -Ai - A 2 - A 3 ) lf i=1 A(u, uji). 

Theorem 4.2 Suppose Assumption 4-1 holds. Let 



E = 



v (l,l) v (l,2) 
v (2,l) v (2,2) 



where S^^, E^ r2 , E^^ are defined in Lemma 4-1 and E^ri,^ = ^T,n,r2- 
Furthermore define fi' = ($lB(ri), . . . , $t,B(r n ), ^^(ri), . . . , QfS(r n )), where 

1 /^ 1 ^ 



B ( r n) = 7T iff \ f f — , vn72 / /(«,w)exp(-2i7rr n «)dudc < ;. 

2* Jo [/M/(w + WrJ] 1/2 Jo 



T/ien we /iawe 



T 



„=1 ( X + 2^ 



.)' 



with + (-gfk/2 + b 2 T l l 2S ) Yln=i (iTT + TT72) ~^ ^ as T ^ ; where X 2m is a normally distributed 
random vector with X 2m = {X±, . . . , X m , Y±, . . . , X m )' and X 2m ~ Af(n, E). Note the small abuse 
of notation, when we say we mean that the distribution of random variable A converges to 

the distribution of random variable B. 

Remark 4.1 We observe that if the matrix E ; define in the theorem above, were the identity matrix, 
then the limiting distribution of T m , is a non-central x 2 m where the noncentrality parameter is 
determined by the limit ofcT(r n ) (for n = 1, . . . ,m). Hence the power of the test depends on the 
deviation of 

J Q 27r yf(u)f(J+u> — )p72 Jo fi 11 ^) exp(—2inru)duduj from zero, for each of the lags r n . We see that 
term depends on the Fourier coefficient a(r n ; oj) := f(u, uj) exp(—2inr n u)dudLd and the magnitude 
of a(r n ; a;) depends on whether the frequency of the nonstationary variation matches r n . 

However, in the case that X t ^ is s second order nonstationary time series, E will not be a diagonal 
time series, because there is correlation between the real and imaginary parts of cr(r) and also 
correlation at different lags r. Thus the limiting distribution of % n will not be a classical non- 
central x 2m ; due to the correlations in E. However, the conclusions discussed above still hold, 
namely the power of the test is determined, mainly, by the magnitude of mean vector \x. 
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5 Simulations 



We now consider a simulation study. We compare the results of the test statistic for both stationary 
and nonstationary time series. In each case, we replicate the time series 1000 times, and for each 
replication we do the test. We do the test for sample sizes T = 256 and T = 512. We do the test for 
to — 1, 5 and 10. The percentage of time the test statistic exceeds X2m(0-05), that is T m > xlmi^-^) 
is given in the tables, we also give plots of the empirical density of the test statistic. 

5.1 Stationary time series 

We first investigate the behaviour of the test statistic under the null hypothesis of stationarity. 

(i) Model 1: X t = 0.8X t ^i + e t , where {e t } are iid Gaussian random variables. We do the test 
for consecutive lags r = 1, . . . , to, the results can be found in the table below. A plot of the 
estimated finite sample density of the test statistic is given in Figure 1. 



T = 256 


TO = 1 


TO = 5 


TO = 10 


T = 512 


TO = 1 


TO = 5 


m = 104 


% reject 


6 


5 


8 


% reject 


5.3 


6.4 


6.5 



(ii) Model 2: X t — X t -i — 0.7X t _ 2 = £t + 0.3e t -i + 2£ t _ 3 , {e t } are Gaussian. We do the test 
for consecutive lags r = 1, . . . , to, the results can be found in the table below. A plot of the 
estimated finite sample density of the test statistic T w is given in Figure 1. 



T = 256 


m = 1 


m = 5 


TO = 10 


T = 512 


TO = 1 


TO = 5 


TO = 10 


% reject 


4.9 


5.9 


6.3 


% reject 


8.33 


4.67 


3.33 



We observe that under the null of stationarity, the percentage rejects, in the tables, and the plots of 
the empirical density in Figure 1 suggest that the ^-distribution approximates well the distribution 
of the test statistic T m . 

5.2 Nonstationary time series 

We now investigate the performance of the test statistic under different types of nonstationary 
behaviour. 

(i) Model 3: In this model there is a change point occuring in the later part of the time series: 
X t = 1.5X t -! - 0.75X t _! + e t for t = 1, . . . , 0.75T and X t = 0.8X t _i + e t for t = (0.75T + 
1), . . . , T. We do the test for consecutive lags r = 1, . . . , to, the results can be found in the 
table below. A plot of the estimated finite sample density of the test statistic T w is given in 
Figure 2. 
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T = 256 


m=l 


m=5 


m=10 


T = 512 


m=l 


m=5 


m=10 


% reject 


100 


100 


100 


% reject 


100 


100 


100 



(ii) Model 4: In this model the variance of the innovation changes smoothly over time: X t = 
0.8Xi_! + a t e t , where a t — (| + sin(|||) + 0.3cos(|||)), where {e t } are Gaussian. We do the 
test for consecutive lags r = 1, . . . , m, the results can be found in the table below. A plot of 
the estimated finite sample density of the test statistic T w is given in Figure 2. 



T=256 


m=l 


m=5 


m=10 


T=512 


m=l 


m=5 


m=10 


% reject 


6.6 


16.9 


26.6 


% reject 


99.9 


98 


94.6 



We mention, that for 1 < t < 256, function a t defined above, does not vary much, which 
explains why the rejection rate for T = 256 is relatively low. 

(iii) Model 5: In this model there is a change point, but the change is quite small: X t = 0.8X t _i+e t 
for t — 1, . . . , 0.5T and X t = 0.6X t _i + e t for t = (0.5T + 1), . . . , T, where {e t } are Gaussian. 
We do the test for consecutive lags r = 1, . . . , m, the results can be found in the table below. 



T=256 


m=l 


m=5 


m=10 


T=512 


m=l 


m=5 


m=10 


% reject 


52 


35.6 


25.2 


% reject 


82 


66 


52 



(iv) Model 6: In this model, the time series is independent with time-varying variance. Define the 
piecewise varying function a : [0, 1] — > 1R. 



f or u (z /[JL ±) [II 15) [16 17\ F18 19NT 
iui u kz 1[ 20 ? 20^' L20 ' 2Q>i L 20 ' 20/' 1-20' 20/J 

a(u) = { 2 foriiG {[8,g),[g,g),[» l]} 

for u G U0 \2- 1) \^ ^) \^ ^) \^ 
iui a c 2Q j, [ 20 , 2Q j, [ 20 , 2Q j, l 20 , 20 j, [ 20 , 2Q ;j 

and the time series X t ^ = a(^)e t , where {e t } are iid Gaussian random variables. A plot of 
a realisation of the time series and the function cr(-) is given in Figure 4. We do the test for 
consecutive lags r = 1, . . . , m, the results can be found in the table below. 



T=256 


m=l 


m=5 


m=10 


T=512 


m=l 


m=5 


m=10 


% reject 


39 


59 


79 


% reject 


62 


85 


99 



We observe that the rejection rate increases as the number of lags used in the test grows. We 
now investigate why. In Figure 5, we plot the rejection rate of T\ (hence one lag) at lags r = 
1, . . . , 120, we do this for both sample sizes T = 256 and T = 512 and we also plot the absolute 
values of the Fourier coefficients of the function cr(-) for r = 1,...,120 (ie. { | a r | } , where 

a ( u ) = J2 r a r ex P(~ iru), which are estimated with a r ,T = \ Ylt=i ^(sll) ex P( ~5i2 r ))- ^ e 
observe that with one lag (in the test statistic) the rejection rate is greatest at the frequencies 
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r that the Fourier coefficients are largest. This illustrates well the theory of the test statistic 
under the alternative. More precisely, for this model, the time-varying spectral density is 
approximately f(u,cu) = cr(u) 2 , and the noncentrality parameter is largest when the Fourier 
coefficient J* f(u, u) exp(—2i7rru)du = <j(u) 2 exp(—2iirru)du is largest. 

We observe that for various types of nonstationary behaviour the test has good power. Moreover, 
we observe that for many types of nonstationarity, by using a few small number of lags r, we are 
still able to reject the null hypothesis. 

6 Real data analysis 

To illustrate the test for second order stationarity we consider two real data examples. For both data 
sets we will use the test statistic T4 = Tj^t=i |ct(^)| 2 - We choice m — 4, because the simulations 
in the previous section show that most nonstationary behaviour appears to be captured in the first 
four lags. We recall that under the null 7^ asymptotically has a chi squared distribution with eight 
degrees of freedom. 

We first test for second order stationarity of the monthly southern oscillation index time se- 
ries observed between January 1950 to December 1987 (T = 453). The data can be found at 
http://www.stat.pitt.edu/stoffer/tsa2/, a plot is given in Figure 6. The test statistic gives 
the value T 4 = 2.66, which corresponds to a p-value of 0.95, hence there is no evidence to reject the 
null of second order stationarity. 

In our second data example we consider the daily British pound/US dollar exchange rate data ob- 
served between January 2000 to October 2009. This data was obtained from http : //f ederalreserve 
In order to ensure the existence of moments we tranformed the data and considered the square root 
of the absolute log differences, that is X t — | log Y^ 2 — log Y" t 2 _ 2 | 1//2 , where Y t is the exchange rate at 
time t. A plot of the transformed data is given in Figure 7. The test statistic based on the entire 
data set gave the value T 4 = 99.4, which corresponds to the p-value p ~ 0. Thus suggests there is 
evidence to reject the null of second order stationarity. To locate the regions of nonstationarity the 
data is partioned into segments of half length, quarter length and eighth length and the test was 
performed on each of these segments. The results are given in Table 1. Studying Table 1 and the 
p-values, there is evidence to suggest that for most periods there is no evidence to reject the null 
of stationarity. However, over the blocks June 2002 - November 2004, and August 2008 - October 
2009, the data seems to be second order nonstationary. This information can be used to fit a model 
with time-varying parameters to the data. 
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T = 99.4 p = 0.000 



T = 29.5 p = 0.000 



T = 78.43 p = 0.000 



T = 20.0 p = 0.010 



T = 23.9 p = 0.002 



T = 5.1 p = 0.752 



T = 51.4 p = 0.000 



T 

p = 



= 7.2 
0.520 



T 

p = 



= 15.5 
0.055 



T 

p = 



= 19.4 
0.013 



T = 18.2 
p = 0.020 



T 

p = 



= 5.5 
0.702 



T 

p = 



= 12.0 
0.151 



T 

p -- 



= 12.1 
0.145 



T 

p = 



= 17.9 
0.022 



Table 1: Test Statistics and p- value 



7 Conclusions 



In this article we have considered a test for second order stationarity. The test is based on the 
property that the DFT of a second order stationary time series is close to uncorrelated. The 
sampling properties of the test statistic under the null are derived under the assumption that the 
time series statisfies a short memory, MA(oo) representation. However, empirical evidence suggest 
that similar sampling properties also hold in the case of both stationary long memory and nonlinear 
time series too. For this general case, it may be possible to use some of the theory developed in 
Kokoszka and Mikosch (2000), however, this is beyond the scope of the current paper and is future 
work. 
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Density Plot of Test Statistics, T(m=1 0), for AR(1 ) with coeff O.e 



Density plot of Test Statistics, T{m=1 0) from Stationary ARMA(2,3) 




10 15 20 25 30 35 40 45 

T(m=10) 




Figure 1: The estimate density of the test statistic 7[ for Left: Stationary Model 1 and Right: 
stationary Model 2 (T = 512) 



Density Plot of T(m=10) for times series with change points 



Dc-nsly Plolo' :est sialics Tcn=10:. c ; AR D'ocsss nilh changing coefficients 
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Figure 2: Estimate density of test statistic for Left: Nonstationary Model 3 and Right: Nonsta- 
tionary Model 4 (right). (T = 512) 



Time Series Plot of AR(1), with coeff: 0.8 



Nor Staionary frnS Series Plot of AR(1] process with varying innovation 
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Figure 3: Left: Stationary realisation Model 1. Right: Nonstationary realisation Model 4. 
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Figure 4: Top: A realisation from nonstationary Model 6. Lower: The time varying function er(-) 

Rejection Rate for various r's tor N=256, and N=51 2 




50 60 70 
Fourier Frequencies 



Figure 5: Top: The x-axis is the lag used in the test statistic r = 1, . . . , 120 and the y-axis is the 
corresponding rejection rate using the test statistic T\ > x|(0.05) for T = 256 and T = 512. Lower: 
Absolute values of the Fourier coefficients of cr(-). 
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Monthly Southern Oscillation Index From 1950 to 1987 
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Figure 6: Monthly southern oscillation index time series between January 1950 to December 1987 



Time Series plot ot Transformed US-UK exchange rate from Jan 4 2000 to Oct 16 2009 




Figure 7: Daily British pound/US dollar exchange rate data observed between January 2000 to 
October 2009 



A Appendix 



In this appendix we prove the results from the main section. 

For short hand, when it is clear that T plays a role, we use the notation := Jj-^k)-, Jk — Jri—^k), 
fk ■= fri^k) and f k = f(u k ). 



A.l Some results on DFTs and Fourier coefficients 

In the sections below, under various assumptions of the dependence of {X t }, we will show asymptotic 
normality and obtain the mean and variance of cr(V)- m the case that {X t } is a short memory, 
stationary time series, then it is relatively straightforward to evaluate the variance of cr(r), since 
{ Jr(<^fc) Jri^k+r)} are close to uncorrelated random variables. However, in the nonstationary case 
this no longer holds and we have to use some results in Fourier analysis to derive var(<5r (?"))• To do 
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this we start by studying the general random variable 



I T 

H T = ■= ^2H(u k )J T (uj k )J T (uj k+r ), 



T 

k=l 



where Jxif^k) = ,/ 27rT ^2t=i Xt exp(-r^— ). We will show that under certain conditions on H(-), H T 



can be written as the weighted average of X t . Expanding J T {ui k )JT{oj k ) we see that Ht can be 
written as 

H T = X t X T exp(-iu r T) (^^2 H(u k ) exp(iu k (t - t)) V 

t,T ^ k=l ' 

Without any smoothness assumptions on H{uj k ), it is not clear whether the inner sum of the above 
converges to zero as \t — r| — > oo, and if the variance of H T converges to zero. However, let us 
suppose that sup w \H'{uj)\ < oo. In this case, the DFT of H(u), ^ Ylk=i H{uj k ) exp(iuj k (t — r)), is 
an approximation of the Fourier coefficient h(t — r) — J H(u) exp(i(t — r)uo)duj (the error in this 
approximation is discussed below). Noting that the Fourier coefficients h(k) — > as k — > oo, we 
have 

T T 

H T * f J2x t J2Ht- r)X T exp(-^r) := - £ X t Y t (r) . 

t=l T t=l 

Now under relatively weak conditions on the Fourier coefficients, {h(k)}, is almost surely finite 
(and second order stationary if {X t } were stationary). Hence H T can be considered a weighted 
average of {X t }, where the weights in this case are random but almost surely finite. We now justify 
some of the approximations discussed above and state some well know results in Fourier analysis. 
An interesting overview of results in Fourier analysis is given in Briggs and Henson (1997). 

The following theorem is well known, see for example, Briggs and Henson (1997), Theorem 6.2, for 
the proof. 

Theorem A.l Suppose that g[0,D] — > R is a periodic function with period Q. We shall assume 
that either (a) sup x |g"(:r)| < oo or (b) sup x |</(;e)| < 00 an d #'(') ^ s piecewise montone function. 
Let 

m 



a(s) = ^^ g(x) exp(isx)dx and a T (s) = ^ ^ g^jr) exp(i- 



Therefore for all s, we have in case (a) \a(s)\ < C svp x \g"(x)\s 2 and in case (b) \a(s)\ < 
C sup x \g'(x)\s~ 2 , where C is constant independent of s and g(-). 

Moreover sup 1<s<T \a(s) — ot(s)| < CT~ 2 . 
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We now apply the above result to our setup. We will use Lemma A.l, below, to prove the asymptotic 
normality result in Section A. 4. 

Lemma A.l Suppose Assumption 3.1 or Assumption 4-1 is satisfied. And let f(ou) be the spectral 
density of the stationary linear time series or the integrated spectral density of the locally stationary 
time series. Let 

G ^ (s) = ^g(/M/K + ^))v^P^> < 15 > 

1 f 2n 1 

andG ^ s) = ^J uWW^^ eMisu)dw - (16) 

Under the stated assumptions we have either (f(uj)f(uj + uj r ))~ l l 2 has a bounded second derivative 
(under Assumption S.l(iv-a) or Assumption 4-1) or that (f(u)f(u + uj r ))" 1 ^ 2 has a bounded first 
derivative and is piecewise monotone (under Assumption S.l(iv-b)). Therefore, sup Wr J2 S < 
oo and sup Wr Y^ =1 \G TlWr (s)\ < oo. 

PROOF. The above is a straightforward application of Theorem A.l. □ 
We will use the result below in Sections A. 2 and A. 3. 

Lemma A. 2 Suppose Assumption 4-1 is satisfied. Define the nth order cumulant spectra as 

j r n—1 x n— 1 

/ n (u,wi,...,w n _i) = , {n/2) _A YlMu,^) >A{u,-^2uj), 

3=1 3=1 

and the Fourier transform 

F n (k;u;i, . . . ,w n -i) = / f n (u,cj 1 ,...,u; n - 1 )exp(i2irku)du. (17) 

Jo 

(i) If Assumption 4-.l(iv)(a) holds, then sup Uja;i) n 1 9 f" ( - u ' lJ ^ 2 " ,uln - 1 ^ | < oo and 

sup \F n {k;u 1 ,...,u n - 1 )\<C sup | — \ — . (18) 



wi,...,a;„_i u,wi,...,w„ 



<9« 2 '|fc| 2 ' 



(mJ If Assumption 4-l(iv)(b) holds, then sup U 0JU ^ \ d M u ^^--^n-i) j < ^ gM^iM ^ a 
piecewise monotone function in u and 

,„ /, M^r. |9/„(u,wi,...,w n _i). 1 
sup |F n (fc;wi, ...,w n _i)| < C sup | 1 — . (19) 

u>i,...,w„ u,wi,...,a)„ C M 

VKe no£e £/ia£ t/ie constant C is independent of the function /„(•) and k. 
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PROOF. To ease notation we only prove (ii), the proof of (i) is similar. We note that 

(27r) Tu = ^-2^)!L du HMu,uj) + 

3=1 r=1 j¥=r 



dA(u,-Y.U«>i) n " 



du ~ »' 



3=1 

Now under Assumption 4.1, sup uw |j4(u, and sup ua; I dA jfa^ 1 are bounded function, hence we see 
from the above that sup u w | ^"("^"'""-1) | j s bounded. Moreover, by using Theorem A.l we have 
(18). The proof of (ii) is similar, and we omit the details. □ 

We observe that in the stationary case A(u, u) = A(u), then F n (k] u>i, . . . , 0J n -i) = for k 7^ 0. 

In the following lemma we consider the error in approximation of the DFT with the Fourier coeffi- 
cient. 

Lemma A. 3 Suppose Assumption is satisfied. Let F n (k;u>i, . . . ,uj n -\) be defined as in (17) and 
let 

1 T t 

F n>T (s; u u . . . , u n ) = — ^2 fn(7p> • • • , Wn-i) exp(i2snt/T) . (20) 

t=i 

Then under Assumption 4-l(v)(a) we have 

\t? I \ t? i \\ s n ,9 2 /n(M,^i,...,^ n _i) 1 

sup |F n>T (s;wi,...,w n ) -F n (s;o;i,...,a; n _i)| < C sup | — 1 — , 

wi,...,w n _i u,oji,...,ui„ OU 1 

and under Assumption 4-l(v)(b) we have 

sup |F njT (s;o;i, . . . - F n (s;o;i, . . . ,o; n _i)| < C sup | 1 — , 

wi,...,a;„_i u,wi,...,u„ Cm -t 

where C is a constant independent of /«(•)■ 

PROOF. The proof follows immediately from Theorem A.l and Lemma A. 2. □ 
A. 2 Proof of Theorems 3.1 and 4.1 

The follow result is due to Paparoditis (2009), Lemma 6.2, and is a generalisation of Brillinger 
(1981), Theorem 4.3.2, for locally stationary time series. 
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Lemma A. 4 (Paparoditis (2009), Lemma 6.2) Suppose that Assumption 4- 1 holds and let /„(•) 
be defined as in (17). Then for ui, . . . , cu n e [0, 2n] we have 

HI \ i t \\ {^) in/2) - l s^ f f t ^ ^ (logT)-\ 

t=i j=i 

Now the following corollary immediately follows from Lemmas A. 3 and A. 4. 

Corollary A.l Suppose that Assumption 4-1 holds, and let F n (-) be defined as in (17). Then we 
have 

( 27r )(n/2)-l (logT)™- 1 1 

cum{ Jt(^J, . . • , Jr(wj„)) = T(ra/2 )-i F »0'i + • • • + j n ; Wji, . . . , ^ n _J + 0( — ^ — + — ) 
and 

1 CflogT)™" 1 C 
| c«m( JrK), • • • , JrK„))| < C ^pr^ + ; ; ; + -j + ^ + ^ , 

where Uj r = 2nj r /T, —T < j r <T and C is constant independent of u. 
The following well known result represents moments in terms of cumulants. 
Lemma A. 5 Let us suppose that sup t K(\X t \ n ) < oo. Then we have 

E(x tl ,. .., x tn ) = j2 n cum ^ teB ^ ( 2i ) 

7r Beir 

where n is a partition of{t\, t2, ■ ■ ■ , t n } and the sum J2 n ^ s done over all partitions of {ti, t2, ■ ■ ■ , t n }. 
We use the above lemmas below. 

Lemma A. 6 Suppose either Assumption 3.1 or Assumption 4-1 holds. Let /(•) be the integrated 
spectral density (or the true spectral density in the case of stationarity). Then we have 



^(^Jk 1 Jk 1 +rJk 2 Jk2+r{fk 1 fki+r — E{.fk 1 fk 1 +r)}{fk 2 fk 2 +r ~ E(/fc 2 fk 2 +r) }^ — ^ j^ _ 



k 2 \ 2 T 



(it) 

C 



E< fkfk+r — E(fkfk+r) f < 



(«T) S 
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(Hi) Suppose Assumption 3.1(i) holds, then we have E{fkfk+ r } — fkfk+r = 0{b + ^). Suppose 
Assumption S.l(iv-a) or Assumption 4- 1 holds, then we haveE{fkfk+ r } — fkfk+r = 0(b 2 + ^). 

(iv) Suppose Assumption 4-1 holds, then we have 

T| Sl + S2 + S3 + s 4 | 2 ^ T 2 T 2 '' 
w here 5 T = 0C-^ + ±). 

PROOF. To simplify notation in the proof we let Wj = ^K(^-). By expanding the expectation in 
(i) we have 



E^Jfei Jk 1 +rJk 2 Jk 2 +r{fk 1 fk 1 +r ~ ^{fkifki+r)}{fk 2 fk 2 +r ~ ^(fk 2 fk 2 +r) } 

E 



w h w j2 w h w jA 



^{Jki—ji Jki—ji Jki+r—j-2 Jk\+r—j 2 '^k\ J k\+r J k 2 Jk 2 +rJk 2 —jz Jk 2 —jz J k 2 +r—j^J k 2 +r—j^) 

~ E ( Jki - j 1 Jki - j 1 Jki +r-j 2 Jki +r-j 2 )~^>(Jk 1 -Jk 1 +r Jk 2 Jk 2 +r Jk 2 -j- A Jk 2 —j 3 Jk 2 +r-j 4 Jk 2 +r-j 4 ) 
E(t//j 2 _j 3 Jk 2 —jz 'Jk 2 +r—j4 •Jk 2 +r—j4)^'i.'Jki Jk\+rJk 2 Jk 2 +rJki—j\ Jk\—j\ Jk\+r~j 2 Jk\+r—j 2 ) 

+E( t /fc 1 _j 3 J j :i Jki+r— j 4 Jki+r— jij^yJki Jk\+rJk 2 Jk 2 +r)^'\Jk 2 —j3 J k 2 —jz J k 2 +r—j4,Jk 2 +r—ji) f -(22) 



To prove the result we first represent the above moments in terms of cumulants using Lemma A. 5. 
We observe that many of the terms will cancel, however those that do remain will involve at least 
one cumulant which has elements belong to the set {Jk 1 -j 1 , J^-jn Jk!+r-j 2 , J^+r- j 2 } and the set 
{ Jk 2 -h-> Jk 2 -j 3 , Jk 2 +r-j 4 , Jk 2 +r-j 4 }, since these can only arise the cumulant expansion of 
E( Jk\ -j 1 Jk-i - ji Jk 1 +r-j 2 Jki +r-j 2 Jk 1 Jki+rJk 2 Jk 2 +rJk 2 -j- A Jk 2 -j 3 Jk 2 +r-j A Jk 2 +r-u)- Now by a careful anal- 
ysis of all cumulants involving elements from both these two sets we observe that the largest cumu- 
lant terms are cum(J fcl _j 1 , Jk 2 -j 3 ) and cum( J^-jn Jk 2 -j 3 ) (the rest are of a lower order). Therefore 
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recalling that ^ J|i: , J;i(; w h w h w h w jA = E^ 2 j 3 j4=i py H K< ^b) and usin S Corollary A.l gives 

^(jki Jki+rJk2 Jk 2 +r{fkifki+r ~ ^(fk 1 fk 1 +r)}{fk 2 fk 2 +r ~ ^(fk 2 fk 2 +r)} 
bT 



< v 1 T\rc( Uji M - | lQ g r y 

(6T) 4ll ^ 6 ^Ifci + A^-ji-jal 2 T> 



31,32,33,. 



C / 1 logT 

< 7777 77 7"^7 + 



6TV|A;i-A; 2 | 2 T 
where C is a finite constant independent of k± and &2- 

The proof of (ii) is similar to the proof of (i), hence we omit the details. We note that if we were 
to show asymptotic normality of fkfk+r: then (ii) would immediately follow from this. 

We now prove (iii). By definition of fkfk+r, using Lemmas A. 4 and A.l, under Assumption 3.1 (i) 
we have 

^{fkfk+r} - fkfk+r = ^2 (tfpyK{^)K{^)^{Jk-hJk-hJk+r-j 2 Jk+r-j2} ~ fi^k) fX^k+r) 

31,32 

jl 32 

Using a similar proof we can show that under Assumption 3.1(iv-a) or Assumption 4.1 we have 
^{fkfk+r} - fkfk+r = 0(b 2 + £jO. Thus proving (iii). 

The proof of (iv) uses Lemma A. 4 and Corollary A.l and is straightforward, hence we omit the 
details. □ 

In the lemma below we prove Theorems 3.1 and 4.1. 
Lemma A. 7 Suppose Assumption 3.1 holds. Then we have 

_ i i \ 

(23) 

Ul Ul '< - I'/" i i-i- J 

Suppose Assumption holds. Then we have 

1 1 1 \ 

(24) 



VT |c T (r) - Mr) I = O + (6 + ^) + + & 2 T'/*) ( 1. + 



VT,e T(r ) _ &(r) | = (_L + (_J_ + fft^ (J_ + _L)^. 

PROOF. The proofs of (23) and (24) are very similar. Most of the time we will be obtaining the 
bounds under Assumption 4.1, however in a few places the bounds under Assumption 3.1 can be 
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better than those under Assumption 4.1. In this case we will obtain the bounds under each of the 
Assumptions (separately). 

To prove both (23) and (24) we first note that by the mean value theorem evaluated to the second 
order we have x~ 1 ' 2 -y~ 1/2 = (-l/2)x-V 2 (x - y) + (1/2) (-1/2) (-3/2)% 5/2 (x - y) 2 , where x y lies 
between x and y. Applying this to the difference cr(r) — cr(r) we have the expansion 

Vf\c T (r)-Mr)\<ll+llI, 

where 

1 = if £ jj^w [hh+r " fkfk+r} and 11 = if ? iut^ {fJk+r " fkfk+r}2 - 

We consider the terms I and II separately. We first obtain a bound for E|7 2 |. Observe that 
E|/ 2 | = 3(E|/ 2 | + E|/f|), where 

h = j= E (/ ^ r + ; 3 / 2 If"}-* ~ Hfkfk+r)} 

h = (/ ^> {E(/fe//£+r) " fkh+rl 

hence we will obtain the bounds E(/ 2 ) and E(7|). Expanding E(/ 2 ) and using Lemma A.6(i) and 
that f^fki+r is bounded away from zero gives 

Expanding E|7|| gives 

C x 1 

e(/|) < rgMw^ E(44+r44+r)x 



(26) 



The bounds for E|/|| differ slightly, depending on the assumption. Under Assumption 3.1, by using 
Brillinger (1981), Theorem 4.3.2, it can be shown that E(Jk 1 Jk 1 + r Jk 2 Jk 2 +r) — 0(T _1 ) (since r ^ 0). 
Moreover, by using Lemma A.6(iii) we have E{/ fcl / fcl+r } — f^f^+r — 0(b + (bT)~ r ). Therefore 
under Assumption 3.1 we have 

E(/ 2 ) = 0((6+^) 2 ). (27) 
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Therefore, under Assumption 3.1, using (25) and (27) gives E|/| 2 = + (b + ^) 2 ) and 

I = > { " + w + 7Br y (28) 

On the other hand, under Assumption 4.1 we do not have that E( J^+r Jk 2 Jk 2 +r) = 0(T~ V ), 
instead we substitute Lemma A.6(iv) into (26) and obtain 

mi) = oa^ + iw+^f). (29) 

Therefore, under Assumption 4.1, using (25) and (29) gives E|J| 2 = + (^ + l)(b 2 + ^-) 2 ) and 



(30) 



We now obtain a bound for II. Since the spectral density f{uj) is bounded away from zero and 
sup w |/t(^) — f(w)\ — > (see (Paparoditis, 2009), Lemma 6.1(H)), we have 77 = O p (II), where 

II — {JkJk+r{fkfk+r — fkfk+r} )• 

^ T k 

To obtain a bound for 77 we use that \II\ < 31 1 \ + 31 1 2, where 

Hi — (\JkJk+r\{fkfk+r — ^(fkfk+r)} ) an d Ih = ~ 7= (| JkJk+r\ {^(fkfk+r) ~ fkfk+r} )• 

^ T k k 
Using Cauchy-Schwarz inequality, Lemma A.6(ii,iv) we have 

W~h\ < ^^E(|^J fc+ .| 2 ) 1/2 (lE{AA.^ -E(AA +r .)} 4 ) 1/2 = ^(^^(t^ + ^tt^))- 

We now obtain a bound for E|77 2 |- Using Lemma A.6(iii,iv) we have 

k ' ' 

Therefore 

= o((± + 5^)^ + W + ^ + T-^) . (31) 

Hence (31) and (28) give (23) and (31) and (30) give (24). □ 
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A. 3 The variance and expectation of the covariance cy(r) 
A. 3.1 Under the null 

It is straightforward to show, under Assumption 3.1, that E(\/Tct(^)) = 0(T -1 / 2 ). We now obtain 
the asymptotic variance of the or(r) under the null of stationarity. We mention that the following 
two lemmas apply to nonlinear time series too. 

The following lemma immediately follows from (Brillinger, 1981), Theorem 4.3.2. We use this result 
to obtain the asymptotic variance of Cy(r), below. 

Lemma A. 8 Let {X t } be a stationary time series where we denote the second and fourth order 
cumulants as k 2 and k 4 . Suppose ^2 k (l + \k\)\K 2 (k)\ < oo and ^2 kl k2 fc3 (l + |&j|)|K4(&;i, k 2 , k^)\ < oo. 
Then we have 

cov(J kl J k2 ,J k3 J k4 ) = (Z^f^-^+OfW 

V t=i / V t=i ' 

+ (^ E + 0(1)) E e*»— +0(1)) 

T 

+(270/4(0^,0^, -o; fe3 )- + (±). (32) 



(33) 



Lemma A. 9 Suppose the assumptions in Lemma A. 8 hold. Then we have 

cov{VTW T {ri),VTW T {r2)) = cov(VT^c T ( ri ) , VT^c T (r 2 )) 

0{T~ l ) n ^ r 2 

1 + Yl lM =i ^L&AKi, -w fcl+r , -Wfe,) + n = r 2 = r 

Furthermore if the tri-spectra f4(u>i, cu 2 , W3) is Lipschitz continuous we have 

cou(v y 7^o r (ri),v y 7fe.(r 2 )) = cou(v^^(ri), VT3&r(r 2 )) 

0{T- 1 ) n ^ r 2 

1, — C<Ji— t^r, — CJ2) 

y/ /(wi)/(wi+o; r )/(a)2)/(w2+Wr) 

and for allr\,r 2 , cov(\^^CT(ri), \/T^scT(r 2 )) = O(rp). 

PROOF. To prove the result we use that 3?c T (r) = I(c^(r)+c^(r)) and 3c T (r) = ^{c T {r) +c T {r)), 
and evaluate cov (v^CT^v^Kr,)), cov( v / T5T(ri),v / TgT(r 2 )) and ^(v^t^i), V*for(r 2 )). 



27 



Expanding cov (\^T c T (r i), y/Tc T (r 2 )) gives 

1 T 

cov(Vfc T (r 1 ),Vfc T (r 2 )) = - ^ 9tm9tm cov ( J ki Jk 1+ri , J k2 Jk 2 +r 2 ), 



ki,k 2 =i 

where = f(ujk)^ 1 ^ 2 f(ujk + o-v) -1 / 2 . Substituting (32) into the above it is easy to show that for 
r\ 7^ r 2 we have cov(v / Tc T (ri), V^Tor^)) = 0{T~ l ) and for r :— r± — r 2 we have 



cov (Vfc T (r),VTc T (r)) 

2 T 1 T 1 

k=l ki,k 2 =l 

2tt T 1 
2 + T* 9^9x1^^, -^ kl +r,-uJk 2 ) + 0(-). 

ki,k 2 =l 



The same method gives us a similar bound for cov(v / Tcr(r), yTbr{r)). Similarly it can be shown 
that unless r 2 = T — r\ we have cov(a/Tct(^i), VTcrfa)) = 0(T _1 ). Also, for r\ ^ r 2 we have 
cov( v / Tc T (r 1 ), v / Tc T (r 2 )) = 0(T~ r ) Altogether this gives us (33). 

We now obtain (34) by using (33) and replacing the sum with an integral. By using that the spectral 
density and tri-spectra /(•) and / 4 (-) are Lipschitz continuous we can replace the summand above 
with an integral to give 

oov(VT6r(r),Vr6r(r)) = 2 + — T T -= ~ ^ d^du* + Q(h. 

2vr Jo Jo y/fMffa+UrjfMffa + Wr) T 



A similar result can be obtained for cov(v / Tc r (r), \/Tc T (r)). Substituting the above into (33) gives 
(34). □ 

Lemma A. 10 Suppose Assumption 3.1 holds. Then the spectral density f(cu) and the phase func- 
tion 4>{oj), satisfy sup w < 00 an d su Po; 10' 0^)1 < °°- 

PROOF. The fact that sup w \f'(uj)\ < oo follows immediately from Assumption 3.1 (i) . To prove 
that sup w |0'( w )l < oo we recall that (f>(u) = arctan ■ Differentiating (f)(cu) with respect to cu 
and using the chain rule gives 



du du> 



Under Assumption 3.1(i,iii) we have inf w 3lA(u) > 0, sup w |A(c<j)| < oo and sup^ | dA ^ \ < oo. 
Therefore sup w < oo. Thus giving the required result. □ 
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PROOF of Lemma 3.1 We use Lemma A. 9 to prove the result. We note that Assumption 3.1 
satisfy the assumptions in Lemma A. 9. Therefore we have the identity in (33). However in the case 
that the time series is linear this expression can be simplified. It is well know that for a linear time 
series the tri-spectra can be written in terms of the transfer function A{uj) that is 

/ 4 (Wl,W2,W 3 ) = — A{u 1 )A{u)- i )A{u) Z )A{-U) 1 -L0 2 -U 3 ). 



Now we recall that for a linear time series A{uo) = a/ f{u) exp(i<j)(u))) hence substituting this into 
the ratio in (33) gives 



2tt 



Substituting the above into (33) gives 



= /t 4 exp (i[(p(uj kl ) - <f>(u kl + uj r ) - (p(uj k2 ) + (p(u k2 + u r )]). 



cov(c T (r), c T (r)) = 1 + y 



^^exp(i[0 fe -<t) k +r\) 



Finally we note that due to Lemma A. 10, the phase </>(•) is Lipschitz continuous, hence exp(i0(o>)) 
is Lipschitz continuous, and we can replace the summand in the above with an integral to give 



cov(c T (r), c T {r)) = o 2 + — 



1 f 2lT 2nr 



+ 0{-). 



This completes the proof. 



□ 



A. 3. 2 The alternative of local stationarity 

We now consider some of the moment properties of or{r) under the assumption of local stationarity. 
Lemma A. 11 Suppose Assumption 4-1 holds. Then we have 



cov(J kl J k2 , Jk 3 Jk 4/ 



+ 



{F 2 (k x - k 3 ;u kl )F 2 (k 2 - h;u k2 ) + F 2 (h - h;u kl )F 2 (k 2 - k 3 ;u k , 2 )} 

(27T) 



T 



-F A (ki + k 2 - k 3 - k 3 ; u kl , u k2 , -u ks 



+ + ^ (F 2 (h - h; co kl ) + F 2 (k 2 - k 4 ; u k2 ) + 

F 2 (h - h;u kl ) + F 2 (k 2 - k 3 ;u k2 ))), 



(35) 



where {F 2 (-;u)} and {F 4 (-; u> kl , cu k2 , ou k3 )} are defined in (20). 
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PROOF. Expanding cov(Jk 1 Jk 2 , Jk 3 Jk 4 ) in terms of cumulants gives 

cov(J kl J k2 , J k3 J k4 ) = cov(J fcl , J fe3 )cov(J fc2 , J k4 ) + cov(J fcl , J fc4 )cov(J fe2 , J fc3 ) + cum(J fcl , J fc2 , J fc3 , J fe4 ), 

finally substituting Corollary A.f into the above gives the result. □ 

PROOF of Lemma 4.1 equations (12) and (13) We first prove (12). We note that from the 
definition of frij^k) in (1) and under Assumption 4.1 we have 

|E(/ T K)) - fM\ = | ^Ki^^MlM^l 2 ) - /K)| = 0(6 2 ). (36) 

3 

We now obtain var(/r(u;fc)). We observe that 

var(/ T K)) = Y,jj^KP^)KP^ 



31,32 



Now we substitute (35) into the above to give 



var(/ r K)) < C£^X(^^ )X( ^ 

31,32 

\F 2 (j2-Ji,iO n )F 2 (j 1 -j 2 ;-LU n )\ + ^F 4 (0;^ 1 ,-^ 1 ,^ 2 )} + O( ^|P ). 

We observe from the above that the covariance terms dominate the fourth order cumulant term. 
Moreover, by using Lemma A. 2 we have sup w \F 2 (s; u)\ < oo, which gives var(/r(a;fc)) = O(^). 
This together with (36) gives (12). 

We now prove (13). Using Lemma A. 4 for n = 2 gives 

Now by using replacing sum with integral and using Lemma A. 3 (noting 

F 2>T (-r; cj fc ) = J X;?Li /(|, Wfc) exp(-itcu r ) and F 2 (-r; w fc ) = /J /(«, w fe ) exp(-i27rrw)dw) gives 

E(£r(r)) = ^ f r ETwfeb^ expt-aw)^ + 0(^1 + i,). (38) 

thus we have (13). □ 

To prove (14) in Lemma 4.1, we evaluate the limiting variance of c T (t) under the alternative of local 
stationarity. 
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Lemma A. 12 Suppose Assumption 4-1 holds. Then we have 

cov(nVfc T ( ri ), KVr~ CT (r 2 )) = \ (rv ntn + v% ur2 + r% ntri + r^J + o(^), (39) 

cov^Vfcrin), ZVf 6r(r 2 )) = ^(4^ + 1^ - 1^ - T^J + O(^), 

cov^VTcrin), zVT~c T (r 2 )) = \(r^ r2 - rf^ - r<£, >ri + r^J + o(^), 

where 

^.ra = ^ ^rjfe^SW* 1 - h; uj kl )F 2 (-k 1 -r ± + k 2 + r 2 ; -u kl+ri ) + 
fci,fc 2 

F 2 (fci + fc 2 + r 2 ; uJ kl )F 2 (-k! - ri - £: 2 ; -u kl+ri )} + 9tm9tm f ^ ~ r u w *u -Wfci+nrWfe), 

fel,fe2 

r T 2) n,r 2 = ^ dri^TMiMki + k 2 ] u kl )F 2 (-k 1 — T\ — k 2 — r 2 ; -w fcl+ri ) + 

F 2 (fci - fc 2 - r 2 ; w fcl )F 2 (-A;i - n + £: 2 ; -w fcl+ri )} + ^f,fe 1 fl , S i7 4(-r 2 - r x ; , -w fcl+ri , -w^), 

fci,fc 2 

r T 3) r l5 r 2 = ^ 9TM9TM{ F ^~ kl + ^ -^kJ-^fci + 7"i - fc 2 - r 2 ; w fcl+ri ) + 
fcl,fc 2 

F 2 (-A; 1 -k 2 - r 2 ; -a; fcl )F 2 (A; 1 + n + £: 2 ; w fcl+ri )} + 9tm9tm F ^ ~ r ^ ~uJ kl ,uj kl+ri , -u k2 ), 

kito 

and the coefficients F 2 {-) and F 4 (-) are defined in (20) and = {f(u; k )f(u} k+r )} ^ 2 . 

PROOF. To prove (39) we use $tcr(r) = |(cr(r) + cr(V)) and ^sct(t) = -y(cr(r) +cr(r)), and 
cov(v^Tcj'(r 1 ), V^Cr( r 2)) and cov(v^Tct(Vi), \/Tc T {r 2 )). Expanding cov(\/Tc T {ri), V^Cy(r 2 )) we 
have 

cov(VTc T (ri), v / Tc T (r 2 )) = - ^ g { ^lg { ^ k \cov{J k J kl+ri ,J k J k2+r2 ), 
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now by substituting (35) into the above we obtain 

cov(Vf~c T ( ri ), VTc T (r 2 )) = r« iira + O(^). 

Similar results can be obtained for cov (VT5 T ( ri ),VTd T (r 2 )) and cov(VTd T ( ri ) , VTd T (r 2 )). Using 
this we obtain the required result. □ 

PROOF of Lemma 4.1, equation (14) This immediately follows from Lemma A. 12. □ 



A. 4 Asymptotic normality 

In this section we prove asymptotic normality of \fTcr{r). Since the locally stationary linear 
time series model includes the stationary time series model as a special case we show asymptotic 
normality of the more general locally stationary model. We start by approximating y/Tdrir) with 
a random variable which only involves current innovations {e t }J = i- We make this approximation 
in order to use the martingale central limit theorem to prove asymptotic normality of VTcrir). In 
this section, we will make frequent appeals to Lemma A.l. 

Using that the locally stationary time series model X t:T satisfies (10) we have can write VTc T (r) 
as 

\ZTc T {r) 

j T T oo 

k=1 J \ k) J \ k ~r r) f T=l j lt j 2 =0 
^ T oo 

= 7pi/2 Yl ~ r ) ex PH™r) Yl ArUMrAh) {zt-h e r-n ~ H^t-ner-n)), 

t,T=l 31,32=0 

where {G T ,u r {s)} is the DFT defined in (15). 

We now partition y/Tcr(r) into terms which involve 'present' and 'past' innovation, that is 

Vf(c T (r) - E(c T (r)) = VT(d T (r) + V T (r)) , (40) 

where 

1 T 

d T {r) = - ^ G T>U!r (t - t) exp(-iruj r ) iptAh^rAh) {e t - h e T _ j2 - E(£ t _ il £ T _ j2 )) 

t,r=i o<n<t-i o<j 2 <r-i 

1 T 

V T (r) = — G T>U!r (t - r) exp(-imv) AAjJAAh) {zt-n e r-h ~ E (^-ji^r-j 2 )) • 



T 

t,r=l ji>t— 1 or 32>t— 1 
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In the following lemma we obtain a bound for the remainder Vr(r). Later we will show asymptotic 
normality of dx(r). 

Lemma A. 13 Suppose Assumption 4-1 hold. Then we have 

(T l ^\V T (r)\ 2 ) 112 <CT- 1 ' 2 , 

for some finite constant C . 

PROOF. We first observe that VTV T (r) = I± + I 2 + h, where 
1 T 

Jl = GT,u, r (t - r) exp(-iru r ) Y Y AT(ji)^T,T(h)^t-he T -j 2 - E(e t -j 1 e T _j 2 )) 

t,T=l 31>t-l 0<j' 2 <t-1 



1 T 

/2 = fi/2 Yl GT,u> r (t - r) exp(-iru r ) Y Y ^tAh^rAh) (st- h E T - h - E(e t _ jl e T _ j2 )) 



t,T=l 



J2>T-1 0<Jl<t-l 



J 3 = ^ E G TiWr (t-r)exp(-i™ r ) ^ ^ i/>t,Ttii)A,Tti2){£t-3i£T-j 2 -E(e t _ jl e T _ ja )). 

t,T=l J2>T-Ijl>t-1 

We first show that E(7 2 ) 1/2 = 0(T~ 1/2 ). By the Minkowski's inequality we have 

E(/ 1 2 ) 1/2 <^Y |<Wt - ^ijlE^ £ £ A,T{3Mr,T{j2)(et- h e T - h - E( £t _ jl£r _ j2 ))] 

t,T=l ^ ^ 7"i>*— 1 0<72<r-l ' 



It can be shown that 



E 



( E E ^t,T{h)^r,T{h)^t-he T - j2 - E(e t _ jl e T _ j2 )) J 

^ 7l>*— 1 0<72<T-1 ' 



'ii><-10<i 2 <r 

< (var(e 4 ) 2 + var(e 2 )) 



Y\^Ah)\ 2 Y\^Ah)f 

jl>t J L J2=0 



Substituting this into the bound for E(/ 2 ), under Assumption 4.1 and using Lemma A.l we have 



WIY 12 < sup 



^sup 



- oo 

£liMj«)| 2 

Lj 2 =o 


1/2 


- T -| 

- 8 = 1 


- oo 


1/2 




£llMj,)| 2 

^2=0 




- 8 = 1 



t=l 

T 



1/2 



t=l ji>t 
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Using a similar method we can show that E(/ 2 2 ) 1 / 2 = 0(T -1 / 2 ) and E(if) 1 / 2 = 0(T -1 / 2 ). Thus we 
obtain the result. □ 



Therefore the above lemma shows that VT(c T (r) - E(c T (r)) = VTd T (r) + O p (T 1/2 ). 

Remark A.l Now it is worth noting that in the case that {X t } is a stationary linear time series, 
then dr{r) has an interesting form. That is, it is straightforward to show (using (Priestley, 1981), 
Theorem 6.2.1) that 

T 

I 1 

VTc T (r) = — V* J £ (u k )J e (uj k+r ) exp (i((f)(uj k ) - <p(uj k+r ))) + O p (T~ 1/2 ), 
^ T k=i 

where J £ {uj) = (27rT) -1 / 2 J2t=i £ t^p(^ UJ k)- 

We use the martingale central limit theorem to show asymptotic normality of \fTdrir), which will 
imply asymptotic normality of \/T(c T {r) — E(cr(r)). To do this we rewrite Vrdxir) as the sum of 
martingale differences 

Vfd T (r) 
1 T 

= T^n S ( £ 51 £ S2 - E(£ S1 £ S2 )) Yl G T<vr(t - t) exp(-iTu r )ip tt T(t - si)^ T)T (r - s 2 ) 

Sl,S 2 = l S!<t<Ts 2 <T<T 

j T s-1 

= ™S Mt ( s ) where Mr(s) = (e2-l)A T (s,s)+e a ^e Sl (A T (si,s)+A r (si,s)) 



TV2 

s=l si=l 



and 



4r(si,s 2 )= 5^ S Gt .^(* ~ r ) ex P(-^™r)^t,T(t - si)^ T>T (r - s 2 ). 

si<t<Ts 2 <r<T 

We now show that the coefficients in the martingale differences are absolutely summable. 
Lemma A. 14 Suppose Assumption 4-1 holds. Then we have 

s-1 



sup ^2(\A T (s, si)| + \A T (s 1} s)\) < oo. 



T 

Sl=l 
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PROOF. To prove the result we note that under Assumption 4.1 and using Lemma A.l we have 



s-l s-l 

52\A T ( ai ,8)\ < E E E \G T ,. r (t - r)\ ■ \^, T (t - ■ \Mr - *) 

si=l s 1= l s 1 <t<T s<r<T 



< 



Ei<w*)i sup ei^oo 2 , 



which gives the required result. 



□ 



In the theorem below we show asymptotic normality of dx{r). To accommodate both the stationary 
and nonstationary case we will let the asymptotic variance of dr(r) be V r and specify its value later. 

Theorem A. 2 Suppose Assumption 4-1 holds. Furthermore suppose that var(Vfd T (r), Vfd T (r)) - 
V r < oo as T — > oo. Then we have 



PROOF. We use the martingale central limit theorem to prove the result. We will show asymptotic 
normality of $ty/Tdr(r). However, using the same method it straightforward to show asymp- 
totic normality for all linear combinations of 3?V^^T( r ) and ^s\^Td T (r) and thus by the Cramer- 
Wold device to show asymptotic normality of the random vector ($ty/Tdr(r), $sVTd T (r)). Let 
M 1)T = 3?M T (s). To apply the martingale central limit theorem we need to verify that the 
variance of T^ 1 / 2 J2d=i ^i,t(s) is finite (which is assumed), Lindeberg's condition is satisfied and 
fE!=i E (%( s ) 2 |^-i) Vr,i (see (Hall & Heyde, 1980), Theorem 3.2). To verify Lindeberg's 
condition, we require that for all 5 > 0, 

1 T 

L T = -EE(M liT ( S ) 2 7(r-V2|M 1)r ( S )| > 5)1^0 4 0, 

s=l 

T — > oo, where /(•) is the identity function and T s = a(Mi j x{s), M 1; r(s — 1), . . . , M 1 ^(l)). By 
using Holder and Markov inequalities, we obtain a bound for the following Lt 

1 T 

^ < (^)-^E E ( M ^( s ) 4 i^-0- ( 41 ) 

s=l 

Now by using Lemma A. 14 we have ^2 (\A T (s, si)| + A T (si, s)|) < oo, therefore 

T T 

supE(iE E ( M ^( s ) 4 ^-i)) = ^supEE(M liT ( S ) 4 ) < oo. 

T 5 = 1 T S = l 
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Since ^ ^J =1 E(Mi i t(s) 4 |.F s _i) is a positive random variable, the above result implies 
h ELi E(M 1iT (s) 4 | J" s _!) = O p (l). Substituting this into (41) gives L T as T -> oo. 

Finally we need to show that 

T T T 

i^E(M 1 , T ( S ) 2 |^_ 1 ) = [E(M liT ( S ) 2 |^-i) -E(M liT (*) 2 )] + ^E(M liT ( S ) 2 ) 4 ^.(42) 

s=l s=l s=l 

Under the stated assumptions, we have j, Y^ s =i ^(^i,t( s ) 2 ) — *■ K,i as T ^ oo. Therefore it remains 
to show 



P t ■= \ E |E(M liT ( S ) 2 |^ s _ 1 ) - E(M liT ( S ) 2 )| ^ 0, 

s=l 

which will give us (42). We will show that E(P^) — > 0. To do this we note that E(P T ) = and 

T T 

var(P T ) = i^^var(E(M 1 , T ( S ) 2 |^ s _ 1 )) + — £ cov(E(M 1 , T ( Sl ) 2 |^ 1 _ 1 ), E(M 1 , T ( S2 ) 2 |^ S2 _ 1 )).(43) 

Now by using the Cauchy Schwartz inequality and conditional expectation arguments for T S2 C T Sr 
we have 

cov(E(M liT (s 1 ) 2 |J- Sl _ 1 ),E(M liT ( S2 ) 2 |^ 2 _ 1 )) 
< [E(E(M 1 , T ( S2 ) 2 |^ S2 _ 1 ) -E(M 1 , r ( S2 ) 2 )) 2 ] 1/2 [E(E(M 1 , r ( Sl ) 2 |^ 2 _ 1 ) -E(M ljT ( Sl ) 2 )) 2 ] 1/2 . 

We now show that sup T Er i=S2 [E(E(M 1iT (s 1 ) 2 |^ S2 _ 1 )-E(M 1iT (s 1 ) 2 )) 2 ] 1/2 < oo. Let Q s = <r(e a ,e s _i, 
then it is clear that for all s, T s C Q s . Therefore, we have E[E(Mi )T (si) 2 |.F S2 _i) 2 ] < E[E(Mi jT (si) 2 |£ S2 
which gives 



E 



E(M lir ( Sl ) 2 |^ S2 _ 1 )-E(M liT ( Sl ) 2 ) 



= E[E(M liT ( Sl ) 2 |^ S2 -i) 2 ] - [E(M liT ( Sl ) 2 )] 2 
< E[E(M 1 , T ( Sl ) 2 |& 2 _ 1 ) 2 ] - [E(M liT ( Sl ) 2 )] 2 . 



Expanding Mi i t(si) in terms of {e t } and using sup Tt \ipt,T(j)\ < oo, it can be shown that 
E[E(M liT (s 1 ) 2 |^ 2 _ 1 ) 2 ] - [E(M liT ( Sl ) 2 )] 2 -> as Sl -> oo, and 

T T 

var(P T ) <-J]sup^ (E[E(M 1 . T ( Sl ) 2 |a 2 -i) 2 ] - [E(M 2 )] 2 ) 1/2 < oo. 

S2 = l Si=S2 

Substituting the above into (44) we have var(P T ) = 0(T _1 ), hence we have shown (42), and the 
conditions of the martingale central limit theorem are satisfied, giving the required result. □ 
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Proof of Theorem 3.2 Using Lemma 3.1 we have 



P I m 



Lemma 3.1, implies that Tvar ^R,c T {j\ ) , . . . , 3or(r m )^ — > diag(l + |/t4(/9(Y), • • • , diag(l+|/«4</7(^ L )). 
Combining this with a similar proof to the proof of Theorem A. 2, gives 



^{tt^mE-)^- • - x + ^(^ H - A ' (0 - /2 '» ) - (44) 

Finally, since m(6 + ^) + + fT 1 ' 2 ) (yi, + 5^5) - 0, using (44) we have (9). □ 

PROOF of Theorem 4.2. The proof is identical to the proof of Theorem 3.2. Hence we omit the 
details. □ 
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